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======================================================================

** Conservation of Energy (atomic view)

*** Introduction

So far we have learned about conservation of energy
-- object view (objects treated individually; and as if they were particles)
-- system view (systems of objects are considered)

Now we wish to explore indivudal objects themselves, and treat them as
they are, as systems of particles at the atomic (and subatomic) level.

This will bring us in close contact with thermodynamics, the study
of the large-scale impact of atomic-scale energies.  We will be
studying thermodynamics more thoroughly later on.

The theory we will be presenting will mostly be based on classical
physics, although some discussion of quantum effects will occur.

We will also not be considering "field energy" at this point.

It is interesting to note that QM was first discovered in the
course of a study of thermodynamics.  This should make sense, given
what thermodynamics is all about.

======================================================================

*** Energy partitioning and normal modes

**** Translational, rotational, and vibrational energy

We have already learned that kinetic energy for a system can
be divided into a center-of-mass portion and a relative
portion:
  K_sys = K_cm + K_rel

It turns out that this subdivision can be extended:
  K_sys = K_tr + K_rot + K_vib
                 ^^^^^^^^^^^^^-- all part of K_rel

Each such category can be further subdivided into a contribution
to K from each individual mode.
  K_sys = sum_i K_tr,i  +  sum_i K_rot,i  +  sum_i K_vib,i

The vibrational modes, which inevitably involve bond deformations,
can also be associated with a potential energy, U_vib.

The total PE (U_vib) can also be subdivided into contributions
from each individual mode.
  U_vib = sum_i U_vib,i


What is a mode?

A mode (or "degree of freedom") represents a particular motion of the
system.

Motion can generally be subdivided into combinations of motion
associated with each mode.

The motion of a single particle system (in 3-D) can be broken down
into 3 modes, one for each coordinate direction:
  x (particle moves in x direction)
  y (particle moves in y direction)
  z (particle moves in z direction)
Any motion of this particle can be broken down into x, y, and z motion.

The fact that total kinetic energy can be partitioned into a
contribution from each mode boils down to:
  K = (1/2)mv^2 = (1/2)mv_x^2 + (1/2)mv_y^2 + (1/2)mv_z^2
Each term corresponds to the KE for each mode.

A two-particle system in 3-D consists of 6 modes:
  1x: particle 1 moves in x direction (particle 2 does not move)
  1y: ditto (y)
  1z: ditto (z)
  2x: particle 2 moves in x direction (particle 1 does not move)
  2y: ditto (y)
  2z: ditto (z)

All motions of this two-particle system can be broken down into these
6 modes.

For an N particle system, there are 3N modes, one associated with each
particle moving in each possible coordinate direction (with no other
particle moving):
  1x, 1y, 1z, 2x, 2y, 2z, 3x, 3y, 3z, etc...

If the N particles are confined to move in 1-D, then only N modes exist.


The modes described above work well for systems of atoms which move
independently (e.g., mono-atomic ideal gas).

When bonding is involved, it makes more sense to define modes in terms
of well-defined motions of the system, the so-called NORMAL modes.

The problem with mode 1x, for example, is that if the N-particle
system starts out moving entirely in that one mode (i.e., particle 1
moves along x; no other particle moves), it won't continue moving that
way.

This occurs because atoms that are bonded to particle 1 will get
dragged along, and start moving themselves.  Depending on bond
directions, there may also be y and z motion as well.

Such a "non-normal" mode is not stable -- it inevitably changes over time.

The normal modes, by contrast, are stable -- they do NOT change over time,
unless there are external interactions.

The normal modes for a fully bonded (i.e., solid) object include
overall (rigid) translational and rotational modes, as well as
vibrational modes which vibrate with a definite frequency.

For a solid object, the vast majority of normal modes are internal
vibrational modes.

There are only 3 translational modes and (up to) 3
rotational modes, regardless of how many atoms there are.

The remaining 3N-6 normal modes are vibrational.

We will spend the next several videos learning more about these
normal modes.

======================================================================

**** Modelling atomic bonds

It turns out that the behavior of bonded atoms is well modeled by
assuming that the atoms are connected by springs with some appropriate
spring constant.

These coupled "spring-mass oscillations" are what gives rise to the
vibrational modes of a system of atoms.

In this video, I will try to explain why this is a reasonable model.


Let's start with the simplest atomic system, a single hydrogen atom.

The hydrogen atom consists of an electron and a proton interacting with
one another via an attractive electric force.

The total energy of this system can be written:
  E_sys = K_cm + K_rel + U_el(r1-r2)

Here, K_cm represents the kinetic energy associated with the motion
of the atom as a whole (mostly the proton motion, since mp >> me).

Since our plan is to treat the atom as a single particle, it makes
sense to combine the K_rel and U_el terms into an internal energy
of the system,
  E_int = K_rel + U_el

The internal motions of the system are best calculated using the
rules of quantum mechanics (this will be explained MUCH later),
with the assumption that the proton (or more precisely, the cm)
doesn't move.

The so-called "energy levels" turn out to be quantized, and the
hydrogen atom actually spends virtually all of its time in the
"ground state" (lowest energy level).

It makes sense for us to set the reference configuration so that
  E_int = 0
(it is actually -13.6eV if the reference configuration has the
electron and proton both at rest infinitely far apart).

Thus, we only have to worry about the kinetic energy associated
with the motion of the atom overall:
  E_sys = K_cm

Thus, a single hydrogen atom can safely be regarded as a simple
particle.  Other atoms can be treated likewise.


Now consider a system with two hydrogen atoms.

This system consists of 2 protons and 2 electrons; each electron bound to
one of the protons.

The total energy of this system can be written:
  E_sys = K_1,cm + K_2,cm + K_1,rel + K_2,rel + U_el

Here, K_1,cm and K_1,rel represent the cm and rel portion of the kinetic
energy associated with atom 1 alone.  Same is done with atom 2.

U_el includes electric potential energies associated with e-e, p-e (4),
and p-p interactions.

Since our plan is to treat each atom as a single particle, it makes sense
to write the total system energy in this way:
  E_sys = K_1,cm + K_2,cm + U_eff
where
  U_eff = K_1,rel + K_2,rel + U_ep + U_ee + U_pp

Once again, the internal motions (mainly the electron motions) are
appropriately calculated using the rules of quantum mechanics, again
with the assumption that neither atom is moving overall (U_pp is
usually calculated separately at the end).

This system is more complicated, because the electrons and protons
making up each of the two atoms can interact with one another.

It follows that U_eff depends on the separation of the two atoms, and
thus acts as an "effective potential energy" for the two-atom system.

It makes sense to set the reference configuration (U_eff=0) to be
where the individual hydrogen atoms are infinitely far apart, and in
their respective ground states.

As the atoms are brought closer together, U_eff decreases because the
electrons will try arrange themselves between the protons for maximum
attraction.

The electrons actually end up being "shared" between the two atoms,
and hence this represents a covalent bond.

If the atoms are brought in too close, U_eff shoots back up again
(because of U_pp), and actually goes to infinity.

The overall graph looks like this
[draw graph U_eff(r); r = separation between atoms
 well depth = 4.8eV (binding energy is 4.5eV)
 equilibrium separation = 0.74 A = 0.074 nm]

The low point for U_eff represents the equilibrium separation between
atoms, and represents a natural bond length.

Moving the atoms farther or closer together increases the potential
energy, suggesting a restoring force.

At room temperature, the bond stretch/compress is usually quite small,
and so U_eff can be effectively replaced by a quadratic function
  [illustrate in diagram]

This suggests treating the bond as if the atoms were connected by
a spring.

Using the known parameters of this U_eff graph, it turns out
  k_eff = 3576 eV/nm^2 = 572 N/m
  omega = 8.280x10^14 rad/s  (f = 1.318x10^14 Hz)

Other bonds (covalent and ionic) can be treated in a similar fashion,
and are well modelled by assuming that the bonded atoms are connected
by springs (albeit with different k values).


For reference:

Hydrogen molecule potential energy curve (copyright status unknown):
  https://chemistry.stackexchange.com/questions/99852/hydrogen-molecule-potential-energy-graph
U_eq - U_inf = -4.8eV (binding energy = 4.5eV; accounts for zero-point energy)
eq occurs at separation = 0.074 nm
k_eff = 572 N/m (from 4D worksheet answer key)

additional info from 4D worksheet answer key regarding H2 bond:
U(r) = De*(1 - e^[-(r-ra)])^2 + U0
  De = -U0 = 4.8eV
  re = 0.074nm
  a = 19.3 nm^-1
k_eff = 3576 eV/nm^2 = 572 N/m
m_H = 938.8 MeV/c^2
mu = 469.4 MeV/c^2 (reduced mass)
omega = 2.760x10^-3 c/nm = 8.280x10^14 rad/s (f = 1.318x10^14 Hz)
hbar*omega = hf = 0.5446 eV
a0 = 8.726 pm  (ground state uncertainty in position)
T0 = 6320K  (mode freezeout temperature)

======================================================================

**** Small systems (1-D and 3-D)

A small system is one which consists of just a few particles.

Examples with up to 4 particles shown below.

Counting normal modes for these small systems is most appropriate when
dealing with an ideal gas consissting of (non-interacting) small
molecules.

Counting and identifying normal modes for these small systems
is also good practice for dealing with larger systems, more
appropriate for solid objects.

Liquids present their own issues...


Linear molecules (along x-axis)

[diagrams: organize by x,y,z and n = mode #]

In general, the amplitude relationships listed are strictly
valid only for the "x" mode (based on calculations in next video).

N=1 atom:  3 translational

n=0
x: right
y: up
z: out
All translational
A_0 = 1

N=2 atoms:  3 translational, 2 rotational, 1 vibrational

n=0
x: right - right  (translational)
y: up - up  (translational)
z: out - out (translational)
A_0 = A_1 = 1

n=1
x: left - right  (vibrational)
y: down - up  (rotational, z-axis)
z: in - out  (rotational, y-axis)
A_0 = cos(pi/4); A_1 = cos(3pi/4) = -A_0

N=3 atoms:  3 translational, 2 rotational, 4 vibrational

n=0
x: right - right - right (translational)
y: up - up - up (translational)
z: out - out - out (translational)
A_0 = A_1 = A_2 = 1

n=1
x: left - zero - right (vibrational, stretch)
y: down - zero - up  (rotational, z-axis)
z: in - zero - out (rotational, y-axis)
A_0 = cos(pi/6); A_1 = cos(3pi/6) = 0; A_2 = cos(5pi/6) = -A_0

n=2
x: right - left(x2) - right (vibrational, stretch)
y: up - down(x2) - up  (vibrational, angle bending)
z: out - in(x2) - out (vibrational, angle bending)
A_0 = cos(pi/3) = 1/2; A_1 = cos(pi) = -1; A_2 = cos(5pi/3) = -1/2

Note: angle bending modes for linear molecules are zero frequency in
the strict spring model that we are using.  We would need to use a
torsional spring model for the bond angle.

Angle bending modes can be modelled using simple springs if there is
an actual angle at equilibrium (e.g., see isosceles triangle, below).

Note the difference between bond stretching modes n=1 and n=2.
The scale of variation of motion for n=2 is shorter than n=1.

N=4 atoms: 3 translational, 2 rotational, 7 vibrational

n=0
x: right - right - right - right  (translational)
y: up - up - up - up  (translational)
z: out - out - out - out (translational)
A_0 = A_1 = A_2 = A_3 = 1

n=1
x: left - left(<) - right(<) - right  (vibrational, stretch)
y: down - down(<) - up(<) - up  (rotational, z-axis)
z: in - in(<) - out(<) - out  (rotational, y-axis)
A_0 = cos(pi/8);
A_1 = cos(3pi/8) < A_0;
A_2 = cos(5pi/8) = -A_1;
A_3 = cos(7pi/8) = -A_0

n=2
x: left - right - right - left  (vibrational, stretch)
y: down - up - up - down  (vibrational, angle bending)
z: in - out - out - in  (vibrational, angle bending)
A_0 = cos(pi/4);
A_1 = cos(3pi/4) = -A_0;
A_2 = cos(5pi/4) = -A_0;
A_3 = cos(7pi/4) = A_0

n=3
x: left(<) - right - left - right(<)  (vibrational, stretch)
y: down(<) - up - down - up(<)  (vibrational, angle bending)
z: in(<) - out - in - out(<)  (vibrational, angle bending)
A_0 = cos(3pi/8)>0;
A_1 = cos(9pi/8)<0 (|A_1|>|A_0|);
A_2 = cos(15pi/8) = -A_1;
A_3 = cos(21pi/8) = -A_0


Planar molecules (x-y plane)

3 atoms; isosceles triangle; equal mass "base" atoms (e.g., H2O)
3 translational, 3 rotational, 3 vibrational

Organized in same manner as N=3 collinear...

right         right     (translation)
       right
  up            up      (translation)
         up
 out           out      (translation)
        out

  up-left         up-right   (vibrational, stretch/sym)
            down
 down-left        up-left    (rotational, z-axis)
           right
    in              out      (rotational, y-axis)
            zero

 down-right         up-right  (vibrational, stretch/asym)
             left
  up-right          up-left   (vibrational, angle bending)
             down
    out              out      (rotational, x-axis)
              in

Angle bending can be modelled properly by a spring attaching
the two "base" atoms.

4 atoms, square: 3 translational, 3 rotational, 6 vibrational

This is a somewhat "floppy" molecule -- I don't know of
any actual examples of this shape, but it is a good case
study to understand normal modes.

3x4 organization hard to justify...

 right  right   (translation)
 right  right

  up     up     (translation)
  up     up

  out    out    (translation)
  out    out

  up-right  down-right   (rotational, z-axis)
  up-left   down-left

    out       out        (rotational, x-axis)
     in        in

    out        in        (rotational, y-axis)
    out        in

  down-left    down-right  (vibrational, rectangular)
   up-left      up-right

    down          up       (vibrational, trapezoidal)
     up          down

    left         right     (vibrational, trapezoidal)
   right          left

  up-left     up-right     (vibrational, symmetric stretch)
 down-left   down-right

 down-right    up-right    (vibrational, angle bending, rhombic)
 down-left     up-left

   out           in        (vibrational, dihedral angle bending)
    in          out

Molecule can be modelled mostly with 4 adjacent bonding springs,
and two cross-springs to represent angle-bonding.

The dihedral angle bonding is zero frequency in this model --
proper modelling of this vibrational mode requires a torsional
spring model for the dihedral angle.

Note: A dihedral angle is an angle between planes.


3-D molecules:

4 atoms tetrahedral (e.g., NH3): 3 translational, 3 rotational, 6 vibrational

I did not do this calculation myself.

Simulations for the vibrational modes can be found by typing
"vibrational modes for NH3" into a search engine.

I found this resource...
  https://www.chem.purdue.edu/jmol/vibs/nh3.html

======================================================================

**** Large systems 1-D

Consider a 1-D lattice consisting of a large number of equivalent
atoms, all equally spaced with equivalent bonding (i.e., equal masses,
equal spring constants).

We consider only motion along the x axis.


Define the following:
  a = spacing between adjacent atoms
  k = spring constant of bonding spring between adjacent atoms (all equal)
  m = mass of atoms (all equal)
  N = number of atoms in 1-D array
  L = Na = length of array of atoms

Equilibrium positions of each atom:
  x_i = (i+1/2)a; i = 0, 1, ..., N-1
The "1/2" makes the math work out better).

[diagram]

The array lies within the interval [0,L] along the x-axis.  This region
is subdivided into N equal "zones", with each atom (at equilibrium)
occuping the center of its zone.

The motion of the atoms is given by:
  x_i(t) = (i+1/2)a + s_i(t)
  s_i(t) = displacement (at time t) of atom i from its equilibrium position

If s_i+1 = s_i, then the "spring" between i and i+1 is neither
stretched nor compressed (x_i+1 - x_i = a, equilibrium length).

Net spring stretch = s_i+1 - s_i
  compressed if < 0
  stretched if > 0
  relaxed if = 0

Newton's 2nd Law applied to atom i:
  m d^2s_i/dt^2 = -k(s_i - s_i-1) + k(s_i+1 - s_i)


Normal modes can be found by requiring all atoms to move
with a single frequency:
  s_i(t) = A_i cos(omega*t)

Plugging this into Newton's 2nd Law above leads to an algebraic
relationship between s_i, s_i-1, and s_i+1.

Careful treatment of this "recurrance relationship", along with
careful treatment of the boundary yields N solutions (one for each
normal mode).

  s_i(t) = A_i cos(omega*t)
  A_i = A cos[(n/N)*pi*(i+1/2)]
  A = overall amplitude factor
  n = 0, 1, ..., N-1  (mode number)
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N)

n=0 corresponds to the translational mode (A_i = A for all
atoms).

n != 0 are the vibrational modes.


Substitute
  x = (i+1/2)a = (i+1/2)L/N   (equilibrium position of atom i)
gives
  A(x) = A cos([n*pi/L]*x)
  s(x,t) = A cos([n*pi/L]*x) cos(omega_n*t)
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N)

The cosine argument for A(x) varies between 0 and n*pi, which
corresponds to n half-cycles.


These are standing wave solutions.  Standing waves will be covered
later.

If you have already studied them (say, standing waves in a string), these
diagrams may look familiar to you:

[diagram: PHYC 2A ch 17 diagram showing standing waves in a string]

Because the string is fixed at the end, there are nodes at both ends,
and this results in an integer number of half-cycles (loops).

Since the 1-D atom array we are considering now has "free ends", it turns
out there are anti-nodes at the end.  The waves are otherwise similar.


[diagrams for each mode:
  s(x,t) plot (fixed times, t=0 (solid) and t=T/2 (dotted))
  diagram below shows array divided into expansion and contraction regions
  with arrows indicating array movement; at both t=0 and t=T/2]

n = 0 (translational mode)

n = 1 (fundamental mode)
d_1 = L  ...length of expansion / contraction region
This is a large-scale vibration mode

Note: negative slope --> compression; positive slope --> expansion

n = 2 (second mode)
d_2 = L/2

General/large n
d_n = lambda_n/2 = L/n
These represent small-scale vibrations


The angular frequency of each mode...
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N)
Note that larger scale vibrational modes are lower frequency.

Esimate for H2 bond from PE graph:
  sqrt(k/m) =approx= 6x10^14 rad/s

Other atoms are more massive, so sqrt(k/m) likely to be somewhat less.

For example, m_O = 16 m_H, and so sqrt(k/m) likely reduced by factor
of 4 if k is similar.

n=0
--> omega_n=0  (zero frequency -- no restoring force)

n << N
--> omega_n =approx= 2*sqrt(k/m)*n*pi/2N = (n*pi/N)*sqrt(k/m)
             << sqrt(k/m)
Very broad motions (none of the bonds are compressed / extended very
much, in spite of large overall motions of atoms at the anti-nodes).
This leads to low frequencies.

Fundamental (n=1) for L=1m, a=10^-10m, N=10^10
--> omega_1 = sqrt(k/m)*(n/N)*pi = 1.9x10^5 rad/s
--> f = omega/2pi = 3x10^4 Hz
For iron (Fe-57), f reduced by around 8 --> f = 4000 Hz about right...

Smallest scale (n = N-1)
--> omega_n = 2*sqrt(k/m)*sin((N-1)pi/2N)
            =approx= 2*sqrt(k/m)*sin(pi/2)
            = 2*sqrt(k/m)
Virtually every atom is moving in a different direction.

Bonds are stretched/compressed a lot -- very short range vibrations.
Leads to very high frequencies (10^14 Hz).

======================================================================

**** Large systems 3-D

Lets begin with the 1-D array again, but this time allow motions in
3-D.

We already discussed the "longitudinal modes", where the atoms
displace along the same axis as the array of atoms.

In addition to those modes are the "transverse modes", where the
atoms displace at right angles to the array of atoms.  The atoms
are lined up on the x-axis, and they move in the y or z direction.

As it turns out, for each direction (y and z), there are again N
modes, characterized by a mode number n varying from 0 to N-1.

Qualitatively, the modes are similar in character to the longitudinal
modes [show s vs x graphs from 1-D].

In the transverse case, the motion of the atoms actually resembles the
graph, and looks like a wave in a string.

The length scale associated with mode n should be given by
  d_n = L/n

The n=0 transverse modes are translational.

The n=1 transverse modes are rotational.

The n>1 transverse modes involve angle bending (see 3-atom, 4-atom
linear molecules).

Does the formula
  s(x,t) = A cos([n*pi/L]*x) cos(omega_n*t)
apply in the transverse case?

Certainly not for n=1 -- rotational mode requires a linear s vs x
relationship.  Not clear how it generalizes for n>1.

Does the formula
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N)
apply in the transverse case?

Probably not in detail, but it is not likely to be far off.

Most important adjustment may be that the "k" may be smaller (bond
angle bending is usually more pliable than bond stretching), so the
frequencies may be smaller for transverse modes compared to
longitudinal modes.


Now consider the full 3-D situation -- the atoms themselves occupy a
rectangular L_x x L_y x L_z lattice:
  L_x = N_x a
  L_y = N_y a
  L_z = N_z a

There are N = N_x N_y N_z atoms total, and thus 3N modes, of which
3 are translational, 3 are rotational, and the remaining 3N-6 are
vibrational.

(Note: this assumes a cubic lattice -- that may or may not be correct
in a particular situation, but we are looking for general trends, and
so such details turn out to not be important.)

Bonds can be modelled using nearest neighbor springs, including
diagonal springs to simulate bond-angle constraints.

Longitudinal vibrational modes involve alternating regions of
compression and expansion (just as in 1-D), and are most conveniently
described by giving pressure as a function of position and time.

Each mode is described by a triple of integer mode numbers (nx, ny, nz):
  nx = 1, 2, ..., N_x-1
  ny = 1, 2, ..., N_y-1
  nz = 1, 2, ..., N_z-1
  P(r,t) = P0 + A sin([nx*pi/L_x]*x) sin([ny*pi/L_y]*y) sin([nz*pi/L_z]*z) cos(omega*t)

[diagram: 2-D example (nx,ny)=(2,3)
  show nodal lines
  alternating regions of + and -]

+ indicates region of increased pressure (compression)
- indicates region of decreased pressure (expansion)
This picture is for t=0.
For t=T/2, all signs reverse.
For t=T, all signs reverse back again.

Pressure is related to forces, which are zero at the edges (hence sin's
instead of cos's).

Boundaries represent nodal lines (planes in 3-D).  There are
nodal lines (planes) in the middle as well for nx,ny[,nz]>1.

Characteristic distance given by
  d_(nx,ny,nz) = sqrt[(nx/L_x)^2 + (ny/L_y)^2 + (nz/L_z)^2]^-1
which reduces to
  d_(nx,ny,nz) = L/sqrt(nx^2 + ny^2 + nz^2)
           =crude= L/max(nx,ny,nz)
if L_x = L_y = L_z = L.

Note that characteristic distance is short if ANY of nx, ny, or nz
is large.


We can also look at the full 3-D case in terms of the motions of
individual atoms (this takes care of the longitudinal and transverse
modes).

Equilibrium positions:
  r_ijk = (i+1/2)*a x-hat + (j+1/2)*a y-hat + (k+1/2)*a z-hat

Motions:
  r_ijk(t) = [equil] + s_ijk(t)
  s_ijk(t) = displacement from equilibrium

s can also be expressed as a function of r_ijk(equil) instead of (ijk):
  s(r,t) = displacement from equilibrium of atom whose equil position is r

Modes (including translational, rotational, longitudinal vibrational,
and transverse vibrational) can be expressed in terms of these
displacements, but it is less clear how to do this.

[Refer back to square molecule example]

Only one vibrational mode is longitudinal (symmetric stretch), and
involves compression/expansion.  (nx,ny) = (1,1).  Not surprisingly,
it turns out to be the highest frequency mode.

The other (2-D) vibrational modes involve shape changes (rectangular,
trapezoidal, rhombic), but no changes in area.

Based on results for 1-D array...
For purposes of mode counting and distribution only (for large
systems), it may make sense to associate each mode with integers
(nx,ny,nz), such that
  d_(nx,ny,nz) = sqrt[(nx/L_x)^2 + (ny/L_y)^2 + (nz/L_z)^2]^-1
Each set of integers corresponds to 3 modes, 1 longitudinal, and
2 transverse (roughly speaking).

For large systems, the fact that some of the modes are translational
and rotational, and questions about nx, ny, or nz equalling zero are
not important factors.

What about frequencies of modes?  How does the following relation
(1-D longitudinal) generalize?
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N)

Accurate generalization may be very difficult, but a crude approximation
may be to first rewrite the 1-D longitudinal mode frequency in terms
of d_n as follows:
  d_n = L/n  --> n = L/d_n
  L = Na     --> N = L/a
  n/N = (L/d_n) / (L/a) = a/d_n
  omega = 2*sqrt(k/m)*sin(a*pi/d_n)

We can then approximate the 3-D general case by using the same
formula
  omega = 2*sqrt(k/m)*sin(a*pi/d_(nx,ny,nz))

Refinements may include using a different "k" for longitudinal (based
on bond stretching) and transverse (based on bond angle bending).

======================================================================

**** Some thoughts on KE partition theorem

I wanted to end our discussion of normal modes with some thoughts on
the energy partition theorem -- the idea that the kinetic and
potential energies of a system of (bonded) atoms can be expressed as a
sum of contributions from each individual normal mode.

I also want to take this opportunity to talk about the general
motion of a system of atoms connected by "spring" bonds, and what
we are really counting when we count (normal) modes.

One thing I hope to impress you with is that the energy partition
theorem, while indeed true (as long as the bond forces can be
approximated as (linear) spring forces), is by no means OBVIOUS.

We already got an inkling of that when we showed that
  K_sys = K_cm + K_rel
which separates out the center-of-mass motion from the rest of the
modes, but doesn't try to separate individual rotational and
vibrational modes from each other.

[Show derivation from "System kinetic energy" in cons-Esys.txt]

In that case, the partition theorem ended up being a consequence of
the vanishing of a "cross term", which results when a velocity, one
that is made up of two contributing components, is squared.

It turns out that the general case is proved in much the same way.


To help illustrate what is happening, recall what we learned about
the 1-D array of atoms (equal masses, spring constants, and spacing).

We started by writing the positions of each atom in terms of
displacements from equilibrium:
  x_i(t) = (i+1/2)a + s_i(t)    i=0, 1, ..., N-1

Imposing the condition that the system oscillates with a single
frequency results in the normal modes...
  s_i(t) = A_i cos(omega t)
  A_i = A cos[(n/N)*pi*(i+1/2)]
  omega = 2*sqrt(k/m)*sin(n*pi/2N)

Thus, one possible motion of the system involves mode n:
  s_i(t) = A_i cos(omega t)
  A_i = A cos[(n/N)*pi*(i+1/2)]

Another possible motion involves a different mode n':
  s_i(t) = A'_i cos(omega' t)
  A'_i = A' cos[(n'/N)*pi*(i+1/2)]

There are a total of N different modes?  Does this mean that
there are only N allowed motions?

Actually no.  The most general motion involves a general (linear)
combination of these motions.  When we say there are N modes, what we
are counting are the INDEPENDENT motions; the number of DEGREES OF
FREEDOM of the system; or the number of independent COORDINATES that
can be used to describe the motion.

Consider a combination of these two modes, n and n':
  s_i(t) = A_i cos(omega t) + A'_i cos(omega' t)
  A_i = A cos[(n/N)*pi*(i+1/2)]
  A'_i = A' cos[(n'/N)*pi*(i+1/2)]

This turns out to be another valid motion of the system.

A and A' can be chosen arbitrarily and independently.

So, given this combined motion, what is the kinetic energy of our
system?

First, the velocity of each atom:
  v_i(t) = ds_i/dt
    = -[A_i omega sin(omega t) + A'_i omega' sin(omega' t)]

The kinetic energy can now be computed:
  K_sys = sum_i (1/2)m v_i^2
    = sum_i (1/2) m [A_i omega sin(omega t) + A'_i omega' sin(omega' t)]^2
    = (1/2) m [
          omega^2 sin(omega t)^2 sum_i A_i^2   <--- KE for mode n alone (if A'=0)
        + omega'^2 sin(omega' t)^2 sum_i A'_i^2  <--- KE for mode n' alone (if A=0)
        + 2 omega omega' sin(omega t) sin(omega' t) sum_i A_i A'_i   <-- cross term
      ]
    = K_n + K_n' + cross-term

The first two terms are the kinetic energy that system would have if
only one of the modes were active.

For example, K_n would be the kinetic energy of the system if we shut off
the n' mode (i.e., set A'=0 and recalculate K_sys).  Similar for K_n'.

The energy partition theorem states that
  K_sys = K_n + K_n'
with NO cross term.

For arbitrary motions (i.e., arbitrary choices of component motions),
there is no guarantee that the cross term will vanish.

But when the component motions are the single-frequency "normal modes"
calculated for a system subject to linear "spring-like" forces, it
turns out that the cross term ALWAYS vanishes.

The general proof can be traced back to the "orthogonality of
eigenvectors of symmetric matrices", an important result from linear
algebra.


In the case at hand, the vanishing cross term is
  2 omega omega' sin(omega t) sin(omega' t) {sum_i A_i A'_i}
which vanishes because (sum_i A_i A'_i) is equal to zero (as long
as n != n').

To see this is reasonable, recall that A_i for a given mode can
be rewritten as a function of equilibrium position (instead of i):
  A_i = A(x) = A cos([n*pi/L]*x)     [x = x_i^eq = (i+1/2)a]
  A'_i = A'(x) = A' cos([n'*pi/L]*x)

[show graphs for A(x) and A'(x) -- show x_i locations with
tick marks]

Since n and n' are different, the two standing waves have different
"wavelengths", and move in and out of phase with each other.

When the graphs align (+ to +, - to -), A_i A'_i tends to be positive.

When the graphs anti-align (+ to -, - to +), A_i A'_i tends to be negative.

Since the graphs spend roughly equal parts of their domain in and out
of phase with each other, it is not unreasonable that
  sum_i A_i A'_i = 0
Note that if n = n', this breaks down, as the A(x) and A'(x) graphs
are identical, and will remain perfectly aligned throughout.

This result can be shown mathematically, and is a very useful result
from wave calculations.


Here is the mathematical proof that (sum_i A_i A'_i) = 0.  I will not
go over it in detail (it uses complex numbers to help with trig
identities, a technique we will learn about later).

Trig identities can be used to show that
  (cos A)(cos B) = (1/2)[cos(A+B) + cos(A-B)]

Thus,
  A_j = A cos[(n/N)*pi*(j+1/2)]
  A'_j = A' cos[(n'/N)*pi*(j+1/2)]
  A_j A'_j = AA' cos[(n/N)*pi*(j+1/2)] cos[(n'/N)*pi*(j+1/2)]
           = (AA'/2)[cos(((n+n')/N)*pi*(j+1/2)) + cos(((n-n')/N)*pi*(j+1/2))]

Let m=n+n' or n-n' be an integer that is not zero (since n,n' >= 0 and n != n').
Then
  sum_j cos[(m/N)*pi*(j+1/2)]
    = Re sum_j exp[i(m/N)*pi*(j+1/2)]
    = Re sum_j exp[i(m/N)*pi]^(j+1/2)
    = Re r^1/2 sum_j r^j      ... r = exp[i(m/N)*pi]
    = Re r^1/2 [r^N - 1]/[r - 1]
    = Re [r^N - 1]/[r^1/2 - r^-1/2]

  r^N = exp[i(m/N)*pi]^N = exp[i*m*pi] = (-1)^m = +1 or -1
  r^1/2 - r^-1/2
      = exp[i(m/N)*pi/2] - exp[-i(m/N)*pi/2]
      = 2i sin[(m/N)*pi/2]

  sum_j cos[(m/N)*pi*(j+1/2)]
    = Re [(-1)^m - 1] / [2i sin[(m/N)*pi/2]]
    = Re (real)/(pure imaginary and not zero)
    = 0

It follows that
  sum_j A_j A'_j = 0


Again, the energy partition theorem result can be generalized to more
than two modes (each cross term only involves two modes at a time),
and to more general vibrational mode situations (2-D, 3-D,
non-identical masses/spring constants, arbitrary geometry).

One can similarly show that U_sys also partitions in the same way.

The point of all this is not to memorize the details of the
calculation that was just done, but to understand the significance,
and NON-TRIVIALITY of the partition theorems -- that these cross terms
cancel, and that should NOT BE OBVIOUS.

And yet, it is true.

======================================================================

*** Non-mechanical energy

**** Thermal energy

We are already familiar with the fact that friction, a non-conservative
force, tends to convert mechanical energy into (mostly) thermal energy,
a non-mechanical form of energy.  Friction causes objects to heat up.

We are also familiar with the idea that thermal energy is simply
vibrational energy at the atomic scale.  As things heat up, the atoms
that make them up vibrate faster.

By contrast, translational and rotational kinetic energy are definitely
forms of mechanical (human scale) energy.

When we began our study of energy (object view), we originally defined
kinetic energy in terms of the object's overall motion (i.e., its
translational kinetic energy).  We made this definition based on the
principle that the object should be treated as a simple particle.

It is therefore tempting to define the mechanical energy of an
object to be the translational + rotational parts of its kinetic
energy (along with any PE associated with forces EXTERNAL to the
object), and include the energy associated with all vibrational modes
as thermal (non-mechanical) energy.

This, however, is over-simplistic.

Large-scale vibrational modes are clearly visible, and should also be
classified as mechanical energy.

[demo: 450.m4v]

The fundamental mode of the oscillating spring is quite visible.
Treating the spring oscillation energy as "non-mechanical" would be
very confusing.

Even for objects that are not as pliable as a spring, these
large-scale vibrational modes constitute sound waves, which are a form
of mechanical energy (at least for the time being -- before they
"dissipate").

So this leads to the following question: which vibrational modes
should be considered "mechanical" (i.e., the energy associated with
those modes count as mechanical energy), and which vibrational modes
should be considered "thermal" (or "non-mechanical")?

What is the cutoff?


To help answer this question, it pays to examine how energy
typically distributes among the different modes.

(1) Translational, rotational, and a few large-scale vibrational
modes will often carry significant energy, significantly more than
any one typical small-scale vibrational mode.

(2) Energy tends to distribute rather evenly among the shorter
range vibrational modes.

It is very unlikely that, for example, the n=983,412 (and no other mode)
will contain significant energy.  It is very difficult to excite a
specific high-n (short range) vibrational mode.

In view of these trends, we should aim to place the mechanical
energy cutoff scale such that...

(1) There are NO modes that should be classified as mechanical that
are instead classified as non-mechanical.  Otherwise, it may be
possible to miss a significant amount of mechanical energy.

(2) There may be some modes that really should be classified as
non-mechanical, but are instead classified as mechanical, as long as
there aren't "too many" of them.  Most modes that should be classified
as non-mechanical should be classified as such.

It therefore makes sense to err on the "small side" when considering
the mechanical energy cutoff.


Fortunately, there is rather large gap between "human scale" and
"atomic scale", so we have some space to play with.

I will propose that we set the cutoff
  d_mech = 1 um = 10^-6 m

Condition (1) is satisfied -- surely any vibrational mode with a
length scale below 1um will be "invisible" and should be considered
non-mechanical (thermal).

To see if (2) is satisfied, let's do some mode analysis.

First consider a 1-D uniform array of atoms:
  L = 1m
  a = 10^-10m
-->
  N = L/a = 10^10
There are 10^10 total modes.

A mode, n, is considered mechanical if
  d_n = L/n >= d_mech
-->
  n <= L/d_mech = 10^6
There are 10^6 mechanical modes

The ratio (# mech modes)/(# modes) is given by
  ratio = 10^6 / 10^10 = 10^-4
ALT:
  ratio = (L/d_mech) / (L/a) = a/d_mech = 10^-4  (independent of L)

Only one mode out of every 10000 is mechanical.

The vast majority of vibrational modes are non-mechanical.
(2) is surely satisfied.

Recall that for 3-D (longitudinal modes anyway), that
  d_(nx,ny,nz) = L/sqrt(nx^2+ny^2+nz^2) =approx= L/max(nx,ny,nz)
In order for a mode to be considered mechanical, we need
  nx, ny, AND nz <= L/d_mech

Thus, the ratio of mechanical / total modes is given (approximately)
by
  ratio =approx= (L/d_mech)^3 / (L/a)^3 = (a/d_mech)^3 = 10^-12

It is even more clear that the vast, vast majority of vibrational
modes are non-mechanical.


In the next video(s), we will now consider the distribution of
energy in these mechanical/non-mechanical modes...

======================================================================

**** Equipartition theorem

The study of thermodynamics and statistical mechanics flourished in
the 1800's, and ultimately culminated in a very deep understanding of
how energy distributes in an object that consists of a large number
of elementary particles.

One of the most important results from this study was the
"Equipartition Theorem", which relates to how energy is distributed
among the normal modes of a large system of atoms.

The Equipartition Theorem is very simple:

When a system is in thermodynamic equilibrium at temperature T (very
important assumption!), the kinetic energy associated with each normal
mode (on average) is given by

  K per mode = (1/2)kT
where
  k = 1.38 x 10^-23 J/K   (Boltzmann's constant; not a spring constant)
  T = temperature of the system in an absolute temperature scale (e.g., K)

Furthermore, the potential energy associated with each vibrational
mode (on average) is also given by
  U per vibrational mode = (1/2)kT

The term "equipartition" means "equal division" -- when the system is
in thermal equilibrium, ALL modes contribute the same KE and PE.

Boltzmann's constant is an important scale factor in thermodynamics,
as it connects temperature units to energy units.
  T=300K (room temperature)  --->  kT = 4.14 x 10^-21 J = (1/40)eV

Boltzmann's constant also appears in the Ideal Gas Law, which you may
already be familiar with:
  PV = NkT  (physics version -- N = # molecules)
  PV = nRT  (chemistry version -- n = # moles, R = (N_A)k = gas constant)

Note that each vibrational mode carries both kinetic energy and
potential energy, for a total energy per mode of kT.

Note also that the amount of energy in each mode increases with
increasing temperature -- consistent with the fact that atoms vibrate
faster when the temperature goes up.

Note also that the mass of the atoms is not factored in here -- this
implies that more massive atoms will vibrate more slowly at a given
temperature (same energy --> less speed).


Example: Consider a 1cm x 1cm x 1cm cube of bonded atoms (a=10^-10 m)
  # atoms = [(10^-2 m) / (10^-10 m)]^3 = 10^24  (about 1.7 mole)
  # modes = 3 x 10^24

Since all but 6 of these modes are vibrational, the average energy
per mode is kT, so that
  total energy = (3x10^24)(kT)
     = (3x10^24)(4.14 x 10^-21 J)    [at T=300K]
     = 12.42 kJ
That is a lot of energy!!!

How much of this energy is mechanical?

We can estimate the number of mechanical modes by
  # modes = 3(L / d_mech)^3 = 3(10^-2 m / 10^-6 m)^3 = 3x10^12
Each of these modes (except 6) carries the same kT energy, so
  E_mech = (3x10^12)(4.14 x 10^-21 J)
         = 12.42 nJ   [at T=300K]

Are all of these modes really mechanical?  One can argue about exactly
where we put the cutoff (d_mech = 1um), but you can see that it makes
little difference -- mechanical energy is negligible anyway.

Virtually all of the 12.42kJ is thermal!

How much of this energy is overall translational?

No matter how big the system is, there are only 3 translational modes,
each with (1/2)kT energy (note: no PE contribution), so
  K_cm = (3/2)kT = 6.21 x 10^-21 J = 0.039 eV
At thermodynamic equilibrium, objects do carry a small amount of KE,
but small indeed!!  There is no perceptual motion of this object.

By the way, this implies that a system in thermodynamic equilibrium
can not have significant large-scale relative motions between
different parts of the system (unless those parts are truly
non-interacting).  Such a system would violate equipartition.

(This, by the way, explains why friction is so ubiquitous -- we will
explore this later.)


The Equipartition Theorem allows us to predict a lot of things at
human scale.  For example, if we increase the temperature from 300K
to 301K, how much energy do we need to add to the 1cmx1cmx1cm object?

Energy at 300K = (3x10^24)(k*300K) = 12.42 kJ
Energy at 301K = (3x10^24)(k*301K) = 12.4614 kJ
Difference = (3x10^24)(k*1K) = 41.4J

According to this calculation, such an object would have a heat
capacity of 41.4 J/K.

This is consistent with the observation that most solids at room
temperature have a heat capacity of 3k per molecule.

It is also clear from this calculation that such a heat capacity does
not depend on the actual temperature (it takes the same 41.4J
additional energy to increase T from 100K to 101K).

This latter statement does NOT agree with experiment!!!!!!!!

Such abherrant behavior of actual systems led scientists to realize
that something unusual (non-classical) is going on at the atomic
scale.

It makes sense that non-classical physics at the atomic scale would
first reveal itself during a study of the impacts of atomic physics on
human scale phenomena.  Such is the nature of thermodynamics.

======================================================================

**** Quantum corrections

The Equipartion Theorem is based on classical physics, and in
particular, is based on the idea that an oscillating system (such as a
simple spring-mass system) can oscillate with any amplitude and
therefore can carry any amount of energy.

According to the rules of Quantum Mechanics, a system that can
oscillate with frequency f (angular frequency omega) can only
change energies by amounts that are integer multiples of
  dE = hf = hbar*omega
  hbar = h/2pi = 1.055 x 10^-34 J s  (reduced Planck's constant)

According to the Equipartition Theorem, the amount of energy
(on average) that should be associated with a given mode is kT.

If kT >> hbar*omega, the quantization condition imposed by QM is not
very restrictive, and the Equpartition Theorem results are pretty much
valid for the given mode.  This holds up at high temperature and/or
low frequency (long-range vibrational modes).

If kT << hbar*omega, the quantization condition in QM is
extremely restrictive.  In such cases, the true average energy of
that mode is much less than kT.


It can be shown that, in general, the average energy for a given
vibrational mode is given by
  E = kT Phi(hbar*omega/kT)
where Phi() represents the QM correction factor given by
  Phi(gamma) = gamma / (e^gamma - 1)

[show a graph of Phi(gamma)]

  gamma = hbar*omega / kT = omega/omega_cutoff
  omega_cutoff = kT / hbar
               = 3.93 x 10^13 rad/s at T=300K

  gamma << 1  --> Phi(gamma) -> 1    (mode is fully active)
  gamma = 0.5 --> Phi(gamma) = 0.77  (mode is mostly active)
  gamma = 1   --> Phi(gamma) = 0.58  (mode is partially frozen)
  gamma = 2   --> Phi(gamma) = 0.31  (mode is significantly frozen)
  gamma >> 1  --> Phi(gamma) -> 0    (mode is completely frozen)


Example: H2 bond
  k_eff = 572 N/m
  m = 1.67 x 10^-27 kg
  mu = m/2  (reduced mass for the two-atom system)
-->
  omega = sqrt(k/mu) = 8.28 x 10^14 rad/s

At room temperaturn, omega_cutoff = 3.93 x 10^13 rad/s, so
  gamma = omega / omega_cutoff = 21.1
  Phi(gamma) = 1.5 x 10^-8   (almost completely frozen)


For most molecular gases at room temperature, the vibrational
frequencies are simply too high, and the vibrational modes freeze out.

By contrast, the vibrational frequencies of most solids at room
temperature are low enough that most of the vibrational modes are
active.

(You wouldn't necessarily get that result from the formulas we
have been introducing: I tried and it didn't work)

This results in a total energy for an N atom solid of
  E = (3N)(kT) = 3NkT
which is consistent with our calculations in the previous video.

However, at lower temperatures, many of the shorter range vibrational
modes may freeze out, thus reducing the overall energy from what
we calculated with the classical Equipartition Theorem.

It is a well-established fact that heat capacities decrease as
temperature decreases and more modes freeze out.

In fact, as T->0K, ALL modes freeze out (even translational and
rotational modes!), and the heat capacities will drop to zero.

======================================================================

**** A failed calculation

I attempted the following calculation to try to assess what impact,
if any, that the QM correction to the Equipartition Theorem would
have on our previous calculation of thermal energy for a simple
1cm x 1cm x 1cm cube at room temperature.

While the numerical results below turn out to be incorrect (and there
are many approximations that occurred, which could explain the
discrepancy), some of the ideas below may be relevant in a real-world
calculation of these impacts at lower temperature.


First, we start with a 1-D uniform array...

For given mode n,
  omega_n = 2*sqrt(k/m)*sin(n*pi/2N) <= omega_cutoff
-->
  sin(n*pi/2N) < omega_cutoff/(2*sqrt(k/m)) = 0.0336  (T=300K, H2 bond parameters)
  n*pi/2N < 0.0336
  n/N < 0.0214
  d_freeze/a = [L/n_freeze]/a = N/n_freeze = 1/0.0214 = 46.8
-->
  d_freeze =approx= 4.68 nm  (if a=10^-10m)

n_freeze represents the mode whose frequency is equal to the cutoff
frequency.  Modes higher than n_freeze will have a higher frequency
and tend to be frozen out.  Modes lower than n_freeze will have a lower
frequency and tend to be more active.

Are ALL modes higher/lower than n_freeze completely frozen/active?
No, but it is a good first estimate.

So we will assume that only those modes for which d_n > d_freeze are
active (and that they are fully active).

We can now get mode counts:
  # modes = L/a
  # active modes = L/d_freeze
  # mechanical modes = L/d_mech
  (# mech modes) / (# active modes) = (L/d_mech) / (L/d_freeze) = d_freeze/d_mech

In 3-D, the analysis is more complicated, but a good first
approximation is to simply cube everything (and multiply by 3 to
account for transverse modes):
  # modes = 3(L/a)^3
  # active modes = 3(L/d_freeze)^3   [assuming same omega vs d as 1-D]
  # mech modes = 3(L/d_mech)^3

Let's revisit the total energy of the 1cm x 1cm x 1cm box:
  # modes = 3(10^-2 m / 10^-10 m)^3 = 3 x 10^24
  # active modes = 3(10^-2 m / 4.68 x 10^-9 m)^3 = 2.93 x 10^19
  total energy = (2.93 x 10^19)(4.14 x 10^-21 J) = 0.121 J   [much smaller -- suspiciously low]
  # mech modes = 3(10^-2 m / 10^-6 m)^3 = 3 x 10^12   [no change]
  mechanical energy = (3 x 10^12)(4.14 x 10^-21 J) = 12.42 nJ

Note that the total energy is MUCH smaller now (most of the smaller
scale modes are frozen out).

E_mech is still insignificant, justifying our choice of cutoff.


At this point, I will assess the above calculation.

Given the reality that most of the vibrational modes are active at
room temperature for most solids, it would seem that our estimates for
omega_n are quite a bit off (by at least a factor of 10).

A big part of that is using the H2 bond for k_eff and m.  Most
atoms have much higher masses.

It is also possible that k_eff is overestimated, especially for the
transverse modes.

Update: I have no verification of this, but one online lecture
calculatd k_eff for copper based on the stiffness of a copper rod
(measured experimentally), and came up with k =approx= 30 N/m for a
single bond.

k_eff was calculated for H2 based on bond length oscillations.
Transverse modes are more likely to involve bond angle oscillations,
which I would expect to have a lower stiffness.

It is also possible that the generalization to 3-D is more complicated
than I am accounting for.


The highest frequency mode is n=N-1, which (for 1-D) leads to
  omega_max = 2*sqrt(k/m) = 1.17 x 10^15 rad/s = 29.8 omega_cutoff

For iron (m=57 m_H), omega would be reduced by around a factor of 8.

We need another factor of 3 or 4 from k_eff and 3-D to force
  omega_max < omega_cutoff
Another factor of 2 or 3 would lead to
  omega_max << omega_cutoff
to enough of a degree that we can safely ignore QM corrections.

======================================================================

**** Chemical energy

Thermal energy is based on the vibrational modes of a system of atoms.
These vibrations involve stretching, compressing, and/or bending of
existing chemical bonds.  The bonds themselves don't change.

By contrast, changes in chemical energy are related to breaking some
bonds and/or forming new ones.


In general, energy is released when bonds are formed.

[see "Modelling atomic bonds" -- H2 U_eff plot]

For example, when two hydrogen atoms are combined into an H2 molecule,
there is a decrease in U_eff of 4.5 eV (binding energy), which can be
transferred to some other form (usually thermal energy).

(Yes, the graph says "4.8eV", but there is about 0.3eV of "zero-point"
energy -- it is related to the proper QM treatment of the bond
oscillation.)

(Also, keep in mind when calculating the thermal energy of a system
that includes these bonds, the PE of the "bond spring" is measured
relative to that baseline energy level; so a chemical reaction can be
throught of as "changing the reference configuration".)

By contrast, the act of breaking an H2 molecule into two hydrogen
atoms will result in an increase of U_eff (from -4.5eV to zero).
That energy must be supplied from somewhere.

In general, "associative reactions" (bond formation) release energy,
and are considered exothermic.  These reactions often occur
spontaneously.

By contrast, "dissociative reactions" (bond breaking) generally
require an input of energy, and are considered endothermic.  These
reactions must be induced by some process that supplies energy
to the system (e.g., electrolysis).

Combination reactions (some bonds break, others form) are exo- or
endo-thermic, depending on the net energy balance of all of the
bonds that are formed or broken.

Different atomic bonds will have different binding energies, so the
net effect of changing the bonds can be quite complicated.


An energetic study of chemical reactions can be quite tricky, and
requires a good working knowledge of thermodynamics to fully
appreciate.

The reason for this is that there will almost certainly be additional
consequences, aside from bond formation/breakage, when chemical
reactions are allowed to proceed.

For example, consider the dissociation of water into hydrogen and
oxygen:
  1 H2O -> 1 H2 + (1/2) O2   [dH = +285.83 kJ/mol -- endothermic]

At room temperature, water is a liquid and occupies relatively little
volume.

However, at the same temperature, both H2 and O2 are gases, and occupy
relatively large volumes.

If this chemical reaction occurs at constant pressure (i.e., exposed
to the atmosphere, which is typical), then the volume of the system
MUST increase, and since its environment is applying pressure to the
system, the system must push back with the same pressure and does
positive work on the environment -- this represents energy transferred
OUT of the system.

A calculation involving the Ideal Gas Law shows that the work done by
the system on its surroundings (T=298K, P=1atm) is 3715 J for every
mole of H2O that is converted.

The "heat of reaction" or "change in enthalpy", dH = +285.83 kJ/mol,
takes this work into account.

Basically, for every mole of H2O that is to be dissociated,
- 285.83 kJ of energy must be input into the system;
- 3.72 kJ of that energy is transferred out of the system as work
done by the expanding gas;
- 282.1 kJ (the remainder) goes into increasing the "U_eff" of all
of the bonds that have now been broken.

Does this mean that it takes only 282.1 kJ of input energy to
dissociate one mole of H2O if the system were NOT allowed to expand?

Well, this would surely increase the pressure (and possibly the
temperature) quite significantly, causing many of the remaining
bonds to be severely compressed.  This could change existing
bond energies.

My guess is that it would take A LOT more than 282.1 kJ to dissociate
1 mole of H2O this way.

Given that chemical reactions almost never take place this way,
we rarely worry about such considerations.


So, for the (spontaneous) exothermic chemical reactions, what
form of energy is the chemical energy converted to?

Normally, we would expect most of the chemical energy to be converted
to thermal energy within the system itself.

If there is a lot of thermal energy released, there could be a
significant increase in temperature, which may cause the system to
expand.

We already know that expanding systems can do work on the environment,
so there may be mechanical energy output from such a reaction.

An explosive reaction could actually release quite a bit of mechanical
energy (e.g: internal combustion engine; gun powder).

More controlled chemical reactions may also be involved in
thermodynamic processes (e.g., heat engines), which may convert a
portion (not all) of the thermal energy into mechanical energy.

A chemical reaction taking place within a chemical battery can also
result in the transport of electrons from one electrode (+) to another
(-) in the battery.  This creates a charge separation that can be used
to drive electric current through a circuit -- this current can run a
motor, which does mechanical work.  The charge separation is
maintained by the chemical reactions.

Naturally, the ability to generate mechanical (human-scale) energy is
very important to us.


For reference:

From http://hyperphysics.phy-astr.gsu.edu/hbase/thermo/electrol.html
  H2O --> H2 + (1/2)O2  (dH = +285.83 kJ/mol -- endothermic)
  W = PdV = (101.3 x 10^3 Pa)(1.5 moles)(22.4 x 10^-3 m3/mol)(298K/273K) = 3715 J
  dU = dH - PdV = 285.83 kJ - 3.72 kJ = 282.1 kJ

This is me attempting to explain to myself what is happening...

For any process,
  dU = TdS - PdV + sum_j mu_j dN_j

Definition: H = U + PV
In general,
  dH = dU + PdV + VdP = TdS + VdP + sum_j mu_j dN_j
At constant pressure (dP=0),
  dH = TdS + sum_j mu_j dN_j

Suppose 1 mole of H2O is converted to 1 mole of H2 and 1/2 mole of O2.
Suppose this occurs at constant pressure.

Presumably, this process will require
  sum_j mu_j dN_j = +285.83 kJ
This represents the increase in chemical energy, stored as electrostatic
PE and KE of electrons involved in bonds.

During this process, 1.5 moles of ideal gas is created at 298K and 1atm,
causing the system to expand in volume, doing an amount of work
  PdV = 3.72 kJ
on its environment.

If there is no other energy transfer in or out of this system other
than the work mentioned, then
  dU = TdS - PdV + sum_j mu_j dN_j = -PdV
which implies an entropy drop according to
  TdS = -sum_j mu_j dN_j = -285.83 kJ
This entropy drop would occur because of a decrease in thermal energy
of the system, owing to the fact that chemical energy has increased.
Chemical energy is a PE, and so there is no entropy associated with
this form of energy.

This reaction cannot occur spontaneously, because universal entropy
drops are not allowed!

It is therefore necessary, in order to execute this process, that
+285.83 kJ of thermal energy be transferred into the system from
infinite temperature (or more than that from a finite, higher
temperature, to prevent dS_univ from being negative).  An example
would be to run an electrical current through this system, depositing
285.83 kJ into the system from a PE source.

In this case, TdS = 0 for the system, balanced by the heat transfer
in and the thermal energy that is converted into chemical energy.

It follows that
  dU = TdS - PdV + sum_j mu_j dN_j
     = 0 - 3.72kJ + 285.83kJ = +282.1 kJ

To summarize, 285.83kJ of energy is added to the system (as a heat
transfer from infinite temperature; or a mech E transfer) from the
outside, in order to induce this reaction.  The system expands as
gas is produced, transfering 3.72 kJ of mechanical energy back out
of the system.  The system overall gains 282.1 kJ of energy.

Note that, since this process occurs at constant pressure, that
  dH = dU + PdV = sum_j mu_j dN_j = 285.83 kJ
which is why dH is used as a measure of heat of reaction.

It follows that the process being considered in the article cited
above includes not only the chemical reaction, but the energy
transfer into (or out of, in the case of exothermic reactions) the
system, which provides the energy needed for the chemical reaction.
This is required in order to achieve entropy balance (TdS = 0).
It follows that
  dH = sum_j mu_j dN_j = net increase in chemical energy

======================================================================

*** The nature of non-conservative forces
**** The nature of fundamental forces

A "fundamental force" (or perhaps more properly, a "fundamental
interaction") represents a (basic) interaction between fundamental
(elementary) particles.

In spme sense, fundamental forces represent the building blocks for
all interactions, in much the same way that elementary particles
(e.g., electrons and quarks (which make up protons and neutrons))
represent the building blocks that make up all objects.

The list of fundamental forces is rather short:
- Gravity
- Electromagnetic
- Weak (radioactive)
- Strong (nuclear)

There may be other fundamental forces that are conjectured to exist
(related to dark energy, etc?), but these are the four that are known
for sure.

The fundamental forces are to be contrasted with other forces, such
as friction and various elastic forces (including spring, normal, and
tension forces).

These latter forces are "emergent forces" -- the result of interactions
(usually electrostatic) among large numbers of elementary particles.


What does any of this have to do with Conservation of Energy?

Recall that when we were first studying conservation principles, that
we identified momentum and energy both as universally conserved
quantities.

For Conservation of Momentum, we were able to derive the universal
conservation principle from Newton's Third Law, first in the context
of collisions, and later more generally.

However, we have yet to come up with any kind of definitive derivation
for Conservation of Energy.

The reason for this is that Conservation of Energy (unlike momentum)
goes beyond Newton's Laws.

So, what is the fundamental fact, which (when added to Newton's Laws)
leads to Conservation of Energy?

It is this one:

---- ALL FUNDAMENTAL FORCES ARE CONSERVATIVE ----

(Note: In QM, all interactions are described by giving PE function;
non-conservative forces have no place in QM)


To understand how this simple, yet profound fact leads to Conservation
of Energy, let us adopt (at least for now) "fundamental particle view".

In this view, we will ignore the distinction between mechanical and
non-mechanical energy, and consider ALL energy as mechanical:
  E_tot = (KE of all elementary particles)
         + (PE associated with all fundamental forces)

We are effectively setting d_mech->0.

The distinction between mechanical and non-mechanical energy is
a manufactured distinction anyway.  It only makes sense because
(1) We are very big (many many atoms).
(2) We are cognitively limited (we cannot follow individual atomic motions).
(3) We don't see very well (at least not at the atomic scale).

If you recall from our discussions of "system view" that for any
system of particles
  DE_mech = W_ext + W_int,nc

Given that all fundamental forces are conservative and that all
energy is mechanical, this statement reduces to
  DE_tot = W_ext

This statement is the hallmark of a universally conserved quantity.
The only way for that quantity to change for a given system is for
there to be transfers into / out of the system.

If we consider the universe as a system, then there is no "external",
and so
  DE_univ = 0

The total energy in the universe never changes.

In the videos that follow, we will discuss why various emergent
forces (such as friction) appear to be non-conservative.


I want to end this video by pointing out one important caveat to all
this, which also applies to Conservation of Momentum.

We have not yet discussed the role of fields (which mediate "action at
a distance" forces) in energy and momentum conservation.

In fact, Newton's Third Law is technically not correct when fields
are taken into account (a fact that I have already mentioned).

However, the fields that mediate these forces can, themselves, carry
both momentum and energy, and it turns out that energy and momentum
are both universally conserved quantities.

In fact, all energy turns out to be "field energy" in this context --
particle fields carry the kinetic (and rest mass) energies, and force
fields carry the potential energy of the forces they represent.

We will discuss fields later, and hold off further discussion of
their role in energy and momentum conservation until then.

======================================================================

**** Friction

Certain forces, such as friction (and other forces we will consider
later), are emergent forces which represent the large-scale effect
caused by many interactions among a large number of elementary
particles.

The fact that these forces can appear to be non-conservative owes to
the fact that energy can (and often will) be transferred from
mechanical modes (large-scale motions) to non-mechanical modes
(small-scale motions) as a result of these interactions.

In this video, we will look at friction in some detail to understand
how this happens.


So, let's remind ourselves what friction is.

Surface-to-surface kinetic friction is the result of bonds continually
being formed and broken as a result of two surfaces sliding past each
other.

[diagram showing surfaces, and bonds between them]

The bonds in question are between the atoms on one surface and the atoms
on the other surface.

If we look at a specific pair, the atoms will attract each other for
the duration of the bond (as if attached together by a stretched
spring).

When the bond initially forms, the two atoms are approaching each other,
so such a force would seem to increase their relatice speed.

[show bond formation between two atoms approaching each other]

When the bond is about to break, the two atoms are receding from each
other, and so the force would seem to decrease their relative speed.

[show bond breakage between two atoms receding from each other]

Given this back-and-forth action and the chaotic nature of these bonds
in general, it would seem that these forces would be as likely to
point in one direction as another, and would therefore average out to
zero (no large-scale effect).

However, this is not how things play out statistically.

While the inter-object bond is in place, the two involved atoms exert
forces on each other.

Each of these atoms is also bonded to their respective neighbors within
their own lattice.

Thus, this inter-object bond can change the relative motion between each
atom and the atoms that make up their own object.

In other words, the bond can result in energy being transferred into or
out of a short-range vibrational mode of each object.


So, which way does it go?

Well, clearly if there is currently NO thermal energy (no energy in any
of the short-range vibrational modes), then the energy in those vibrational
modes can only increase.

And that energy must come from the mechanical energy associated with the
large-scale motions of the system (i.e., the macroscopically observable
relative motion of the two objects).

After all, total energy IS conserved (bond forces are conservative).

However, no thermal energy implies T=0, and that is very unlikely.

What if there is thermal energy already?
Could it decrease as a result of friction?

That turns out to be very unlikely.

According to the Equipartition Theorem, thermodynamic equilibrium for
a system is achieved when all modes are equally populated with energy.

When there is significant relative motion between two objects, certain
mechanical (large-scale) modes are carrying a significant amount of
energy, when compared to the short-range vibrational modes.

Example: m=10kg, v=5m/s --> K = (1/2)mv^2 = 125J >> (1/40)eV

One of the fundamental principles of Thermodynamics is that systems,
when left to themselves, tend to move towards thermodynamic
equilibrium, and therefore, towards equal distribution of energy
among the different modes.

The bond formation/breakage provides a mechanism by which energy
redistribution can occur.

The rules of Thermodynamics does the rest.

If we place the block and floor in its own system, the relative motion
of the block and floor will inevitably decrease to (nearly) zero as
the bond formation/breakage continues to occur.

Keep in mind that there are MANY more non-mechanical modes than
mechanical modes, so virtually all of the redistributed energy will
end up in the non-mechanical (thermal) modes.

Note that the conversion from mechanical to thermal energy will
result in an increase in temperature for the involved objects.

Friction is a HEATING process.


It should be noted that the same thing happens when a solid object
passes through a fluid (gas/liquid) -- random collisions /
interactions with fluid molecules will cause mechanical energy to
convert to thermal energy, leading to a non-conservative drag force.

In the next video, we will look at other "dissipative" emergent
forces, and examine this phenomenom more deeply.

======================================================================

**** Other dissipative forces

Any emergent force which can lead to short-range relative motions will
likely have a degree of non-conservativity, due to the conversion of
mechanical energy into thermal energy.


Common examples include normal/tension/spring forces.

For example, when two objects collide, the relative motions of
adjacent atoms making up each of the colliding objects may be
significantly increased as a result.

This would result in a non-conservative collision force, and thus, an
inelastic collision.

In practice, for most hard materials (e.g., metals, ceramics, etc),
this effect is actually negligible, but if the collision is violent
enough (i.e., with enough speed), bonds may permanently break as a
result of the collision.

The added atomic-scale PE of a broken bond ultimately comes from the
available mechanical energy before the collision, resulting in an
inelastic collision.


This same thing can happen with springs that are overstretched, even
"non-violently" -- bonds may break as a result of overstretching.

Even if springs are NOT overstretched, there may be leakage of
energy into the non-mechanical modes due to the constant stretching
and compressing of the spring.

The relative motions of the adjacent atoms in the spring due to this
continuous oscillation will, over time, result in an increase in
thermal energy at the expense of the oscillation.

This leakage is usually quite neglible in practice, however, so the
dissipation of mechanical energy can take a long time.

[demo 450.m4v -- takes minutes to dissipate, even WITH air resistance]


Technically, a sound wave passing through an object is a large-scale
vibrational mode, and normally counts as mechanical energy (as long as
its half-wavelength exceeds d_mech).

Sound, however, will eventually dissipate and convert into thermal
motions at the expense of the longer-range sound wave.

(may actually be related to imperfections in the lattice structure, and
scattering of sound waves.)

(Note: this is different from the 1/r^2 decrease in sound intensity
from a point source due to the spreading of the wave.)


Even gravity can be non-conservative to some degree when tidal forces
are accounted for.

The term "tidal forces" refers to the fact that gravity can be
non-uniform.  Such non-uniformity can result in relative stresses
within an object subject to gravity.

The classic example of tidal forces is the tides, which result from
the moon's gravitational pull on the earth and all of its belongings,
including its oceans.

Since the earth is quite large, there is a non-negligible difference
in the gravitational pull from the moon on the parts of the earth
closest to the moon, at the center of the earth, and farthest from the
moon.

These differences in force strength are responsible for the tides,
which result in relative motion between the oceans and the rest of the
earth.  This motion of the water is on a much smaller scale than the
earth itself.

[diagram showing elongation that results]

Not only does this relative motion result in frictional losses in this
particular example, but the moon/earth tidal forces can serve as a model
to illustrate what can happen on a smaller scale when two objects of
finite size attract one another gravitational.

Such inequity in gravitational pull can result in relative motions
between adjacent atoms, which lead to the generation of thermal energy
at the expense of mechanical energy.


It is clear that friction and other dissipative processes are very
common.

This is because the conditions required for dissipation are so
minimal: any interaction that can transfer energy to and from
small-scale vibrational modes will likely result in a conversion from
mechanical to thermal energy.

This explains why friction is so hard to completely eliminate.

======================================================================

**** Conservative emergent forces

Given how ubiquitous dissipative processes are, it is remarkable that
there would be examples of emergent (large-scale) forces that ARE
conservative, or nearly so.


The classic (and provable) example is uniform gravity.

Consider a projectile launched near the surface of the earth, where
g = 9.80 m/s^2, (essentially) independent of position.

If the particles that make up the object are initially moving at the
same initial velocity (no relative motions), and no other (external)
forces are acting), then all particles will accelerate with the same
acceleration, and thus will follow entirely parallel paths --
maintaining their relative positions and motions.

Under perfect conditions, there would be no need for bonding forces --
a carefully launched ball of sand will paintain its shape throughout
flight.

Of course, real (solid) objects are held together with atomic bonds,
and there is temperature, so thermal vibrations will be happening.

However, those vibrations will be unaffected by the gravitational
force.

One way to see this is that, since g is uniform over the system, an
observer moving with the system (i.e., cm frame) will observe the
object in weightlessness, much the same as the astronauts in an
orbitting space station.

By the way, real objects near earth are also travelling through the
atmosphere, which WILL affect relative motions of adjacent atoms as
air molecules strike the object -- but this is due to the frictional
nature of air resistance, not directly because of gravity.

Another way to see that uniform gravity would have no impact on
internal motions is to calculate the work done by gravity on the
system in detail:
  W_sys = sum_i W_i
        = sum_i int m_i g dot v_i dt
        = int g dot [sum_i m_i v_i] dt    [g factors because it is uniform]
        = int Mg dot v_cm dt
        = W_cm     [only cm motion matters]
  W_rel = sum_i W_i,rel
        = sum_i int m_i g dot v_ic dt
        = int g dot [sum_i m_i v_ic] dt
        = int g dot [0] dt
        = 0

This implies that work done by gravity can be calculated as if the
object were a particle, and thus our previous calculations of work done
by gravity, and the fact that gravity is conservative, are completely
valid.

(We will see later that our "object view" calculation of W_fk will
need to be reconsidered.)

Note that g only needs to be uniform over the system as it is at any
one time -- it need not be constant over the entire span of the motion
of the object.

Thus, an object can travel over a large range where g varies
(e.g., launch an object into space, eventually moving very far away),
as long as the object itself is small enough.


Another emergent force that is (in practice) conservative at the large
scale is the spring force, especially in a "non-violent" context,
where the spring is not over-strained, or strained too suddenly (as in
a collision).

The normal force and tension force, under normal (i.e., non-collision)
conditions, will also fit into this category.

Even though one might think that small-scale vibrational modes could
be excited by spring-mass oscillations, the fact is that this form of
dissipation is normally very small, and only happens over very long
time periods.

[demo 450 -- even with air resistance, takes several minutes for amplitude
to drop]

We will explain why this makes sense more carefully in the next video
when we discuss "protected modes".


We already discussed how violently strained elastic systems (i.e.,
collisions) can result in a transfer of energy to small-scale thermal
vibrations, and how over-straining can lead to broken bonds.

However, it is also possible that collisions between ordinary objects,
especially hard objects (e.g., metal/ceramic balls) can be very nearly
elastic, with coefficients of restitution that are near 1.

This is especially true at low velocities where bonds are not
permanently broken.


What about electromagnetic forces?

We will be discussing these forces in more detail later, although we have
already been introduced to Coulomb's Law (inverse square law) for charges.

Unlike elastic forces (and like gravity!), electromagnetic forces are
actually fundamental forces.

These interactions generally occur on a macroscopic scale when objects
have a significant electric charge and/or a significant magnetic dipole
moment (so-called charged and/or magnetic objects).

The "emergent" version of this force arises ultimately from a
combination of smaller scale forces among the constituent atoms (i.e.,
every electron/proton in one object interacting with every
electron/proton in the other object).

Are such forces conservative at the macro level?

Usually very nearly so.

Yes, there can be relatively different forces exerted on adjacent
atoms (in fact, that is more likely with EM forces than with gravity,
even non-uniform gravity, because EM forces are not
mass-proportionate), but in practice these effects are usually quite
small.

E.g.: One might charge a ball (consider its surface as charged), and then
attract or repel it with (e.g.) a charged rod.  [diagram / demo 460.m4v]

The surfaces interact and push/pull each other.

Ultimately, this force is transferred to the objects as a whole through
interatomic forces within each object, in much the same way that a normal
force (a surface force) can result in an entire object accelerating.

There may be compressions/stretchings that occur.

But these usually have little effect on mechanical energy, as explained
previously.

BTW: Magnetic bumpers make very good elastic bumpers in collision
experiments.


Non-conservativity of EM forces usually comes about when electric
currents are involved.

Actually, the base interaction is conservative, but the resulting
electric current will often result in thermal energy as the moving
electrons collide with the atomic lattice.

(Note: the forced motion of electrons through a wire due to an
electric field (generated, e.g., by other charges) behave more like an
object falling at terminal velocity, rather than free-falling.)

Large metal objects moving near large magnets can result in large
"eddy currents", which will result in a loss of mechanical energy.
[demo 530.m4v -- if available]

Non-conservativity can also result when there are very highly
accelerated charges, resulting in the emission of EM radiation
(fields).

======================================================================

**** Protected modes (conservation of momentum; angular momentum, etc)

To understand how mechanical energy can be (largely) conserved in
macroscopic processes, despite the seemingly ubiquitous presence of
dissipating processes, it pays to look at this in more detail.

According to the equipartition theorem, when a system achieves
thermodynamic equilibrium, the energy of that system should be
distributed evenly among all of its normal modes (K,U = (1/2)kT per
mode).  There are quantum corrections, but this is essentially the
rule.

One important caveat: in order for energy to distribute evenly in
this way, there must be SOME mechanism for energy to transfer from
one mode to another.

If that mechanism does not exist, thermodynamic equilibrium can be
achieved without such an even distribution of energy.

(Note: energy would distribute evenly among all modes that are
mutually "connected" by such a transfer mechanism.)


Consider the center-of-mass (translational) modes of a completely
isolated system.

In this case, since the system is isolated, momentum is conserved.

This implies that v_cm cannot change, and therefore K_cm cannot
change either.

The energy currently stored in the cm (translational) modes is
essentially locked in that mode.  This is a protected mode (protected
by Conservation of Momentum).

Actually, for a single isolated object, K_cm can be "legislatively"
reduced to zero simply by examining that system in the cm reference
frame.

However, consider an isolated system consisting of two (or more)
subsystems (i.e, separate objects), each completely isolated from the
others.

In that case, v_cm for each subsystem (i.e., each object) cannot
change, and so K_cm for each subsystem cannot change.

Thus, if we have a system of several non-interacting objects, the
KE of each object cannot change.

This KE cannot be eliminated simply by changing the reference frame --
the best we can do is adopt the overall cm (entire system) reference
frame.

What we called "K_rel" (when we were still ignoring atomic motions)
would still be preserved.

This large-scale KE (what we, in the past, considered to be the
kinetic energy of each object, treated as a whole) is mechanical
energy, and would not be able to change.

The rotational modes for completely isolated systems/subsystems
are likewise protected by Conservation of Angular Momentum, in
much the same way.


What about vibrational modes?  Are they protected?

Actually, it turns out that ALL modes are protected, including the
vibrational modes, as long as the system is completely isolated.

We defined a normal mode to represent a stable motion -- not changeable
over time, unless there are external interactions.

Vibrational normal modes oscillate with a definite frequency, and maintain
their relative motions for all time.

Solving for the normal modes involves imposing this "single frequency"
condition on the system as it undergoes its motion [see "Large systems 1-D"].

This explains why mechanical energy is normally conserved in simple
large-scale spring-mass oscillations:
  [demo 450.m4v]


Of course, this "protection" only exists if the relevant
systems/subsystems are COMPLETELY isolated.

This ideal is almost never realized -- there are bound to be some
interactions with external systems.

In the case where there are external interactions, generally all
normal modes within a given system are accessible to each other --
energy can transfer from one mode to any other mode.

If the interactions are very weak, however, this transfer may
take a long time.

(Technically, such a system would not achieve true thermodynamic
equilibrium until the energy has completely spread uniformly.)

Thus, it can take some time for friction to act.

There may also be slow leakage from large-scale vibrational modes.

[demo 450.m4v]

The spring is attached to a rope -- the rope stretch will change
slightly as the spring-mass oscillates, allowing a small amount of
energy to leak out of the spring-mass system into the rope (and
beyond).


Interesting rabbit hole -- using disconnectivity trees to understand
energy redistribution over different time scales.

======================================================================

**** Creation of mechanical energy

We have been discussing mechanisms by which mechanical energy can
be converted into non-mechanical energy.

A reasonable question: can the process go the other way?

Is it possible to convert thermal (and other non-mechanical) energy
into mechanical energy?

The answer, as it turns out, is a qualified yes, but there are limits
imposed by thermodynamics.

(BTW: if the answer were no, there would be nothing interesting
happening in the universe right now -- friction would have killed off
all interesting motions long ago.)


To better understand these limitations, it pays to dig a little
deeper into thermodynamic principles, and introduce an important
concept from thermodynamics: the concept of ENTROPY.

Entropy is often defined as a measure of disorder in a system.

Entropy can also (formally) be defined in terms of the number of
atomic states consistent with a given macro-state (e.g., how many
different ways are there to arrange the atoms in a glass containing
500ml of water at 300K).

These definitions (disorder vs. # atomic states) are quite related.

Classic example:
- a whole plate is relatively low entropy (ordered state)
- a broken plate is relatively high entropy (disordered state)

How many ways can atoms be arranged in a whole plate?  Once a few
atoms are positioned, the positions of the remaining atoms are
pretty much fixed.

For a broken plate, each piece can be positioned, oriented, and can
move independently, resulting in many more atomic states.

If a whole plate is dropped on the floor, it tends to break:
  low entropy --> high entropy

If a broken plate is dropped on the floor, it is very unlikely that
the plate will spontaneously reassemble into a whole plate -- the
numbers do not favor this.

Instead, a broken plate will likely break even more
  high entropy --> even higher entropy

According to the Second Law of Thermodynamics, the total entropy of
the universe is increasing.

There is a tendency for systems to progress from lower entropy to
higher entropy.

It is a pure numbers game.


How does the entropy concept relate to the equipartition theorem?

Suppose we have an object consisting of a large number of atoms.

Suppose that there is significant mechanical energy, which implies
that most of the energy of this system is locked up into a relatively
small number of mechanical modes (e.g., translational modes).

With most of the energy locked up in a relatively few mechanical
modes, there is very little flexibility in how the remaining energy is
distributed -- this arrangement tends to be relatively low entropy.

If the energy were redistributed more evenly among all modes
(mechanical and non-mechanical), there is much more flexibility in
that kind of distribution -- this arrangement tends to be relatively
high entropy.

Thermodynamic equilibrium is achieved when entropy reaches its
maximum value -- this occurs when all modes are equally populated
with energy (equipartition theorem).


Since virtually all of the normal modes are non-mechanical (thermal),
it is clear that the overwhelming trend is for mechanical energy to
be converted into thermal energy.

Mechanical energy is a very low entropy form of energy (practically zero).

Thermal energy is a high entropy form of energy.

Given this overwhelming trend to convert mechanical energy into
thermal energy, how is it possible to produce mechanical energy?

In a thermodynamic context, the usual way of producing mechanical
energy is to have a system (one that is under pressure) expand.

When such a system expands, it pushes outward on its surrounding,
doing positive work.

The motions produced by this expansion are large-scale, and hence
mechanical in nature.


One way to generate this expansion is through an uncontrolled, highly
exothermic chemical reaction: an explosion.

For example, gasoline is placed into a spark chamber inside of a
car engine -- the gas is ignited, causing an explosion, which pushes
on a piston.

That piston is connected to a gear system, and ultimately, to the wheels
of a car, causing them to rotate.

Static friction between the tires and road will then cause the car to
accelerate forward.

This represents a conversion of chemical energy to mechanical energy.

Chemical energy actually represents a low entropy form of energy (even
though it is atomic scale), since it is a potential energy (not
available for motion).

Therefore, there will be a (quite significant) increase in entropy
as this chemical energy is converted into thermal energy.

Given the arrangement of the system, part of this "explosive energy" is
converted into mechanical energy, another low-entropy form of energy.

The bulk of the energy released by the chemical reaction remains as
thermal energy.

(Note: the engine will need to be cooled.)

Overall, there is a net increase in entropy.


Note that a process which attempts to directly convert thermal energy
into mechanical energy will result in an overall decrease in entropy,
a clear violation of the Second Law of Thermodynamics.

To avoid violations of the Second Law of Thermodynamics, mechanical
energy must be produced in conjunction with some other process which
normally increases entropy, so that there is a net overall increase
entropy, in spite of the production of mechanical energy.

Such an entropy-producing process can be thought of as generating
the necessary "thermodynamic pressure" to drive the production
of mechanical energy.

In the case of an explosion, the entropy-producing process is the
explosion itself -- a highly exothermic chemical reaction greatly
increases the entropy.

A more controlled thermodynamic process involves the production of
mechanical work by taking advantage of the tendency for heat to
flow from a hot object to a cold object.

The idea is to draw heat from a hot object, allowing part of it
to flow into a colder object, and the rest gets output in the
form of mechanical work.  [heat engine diagram]

The efficiency of such an engine (i.e., fraction of energy drawn
from hot object that can be converted to work) is limited by the
Second Law of Thermodyanics.

This type of process is called a heat engine, and will be studied
in more detail when we discuss thermodynamics in detail.


Note: the earth is one giant heat engine:
  6000K (sun) -> 300K (earth) -> 2.7K (universe)
Heat flowing along these temperature gradiants provides plenty of
"thermodynamic pressure", driving the production of chemical energy
(e.g., photosynthesis) and mechanical energy (e.g., wind power).

Chemical energy can also be converted to mechanical energy within
organisms (such as us).

======================================================================

*** Heat flow

**** Definition of heat

We are all familiar with what happens when a hot object is placed next
to a cold object -- the hot object cools off and the cold object warms
up.

This represents an energy transfer, but is not normally considered
"work" since no mechanical (i.e., large-scale) force is involved.

In thermodynamics, we call such energy transfers "heat transfers", and
we say that heat flows from the hot object to the cold object.

The First Law of Thermodynamics (energy conservation) is normally
stated as follows:
  DE_sys = Q + W
  Q = net heat flow into system (during the given time interval)
  W = net work done on the system (...)

(Note: there is a commonly used sign convention that asserts that W
should be net work done BY the system on its environment.
 --> DE_sys = Q - W
I do not recommend this convention -- it is utterly confusing.)


Normally, in the course of studying thermodynamics, emphasis is
placed on the internal (vibrational) modes of an object, and so
the First Law normally reads
  DE_int = Q + W

This specifically ignores overall translational and rotational
motion of the object.

Forces which might affect such motion are disregarded or assumed to
not be present when calculating W -- only the "pressure force" is
included (W = -P DV).

It isn't that such "thermodynamic objects" can't move or rotate --
it is merely a convenience of study -- one thing at a time.

======================================================================

**** Heat as atomic scale work

There are a number of mechanisms of heat transfer:
- convection
- conduction
- radiation

Convection is fairly straightforward: energy is transfered by having
the hot material physically move to where the cold material is
(perhaps mixing).

Normally convection involves fluids (liquid or gas) where other forms
of heat flow (such as conduction) are inefficient.

Radiation involves the emission and absorption of EM radiation, and
involves the energy content of the electromagnetic field -- this is
beyond the scope of what we are discussing here.

Conduction is more interesting -- here the heat flow is related to
neighboring atoms bumping into each other, and transferring energy
that way.

This mechanism normally involves solid objects where atoms are close
enough together and bonded to make this form of transfer efficient.

It should be noted that convection may also involve conduction
to some degree:
  hot object --> surrounding air --> cold object
The two links shown are conduction steps, but there is a middle
convection step as the heated air actually travels from the hot object
to the cold object.


Let's dig into heat conduction more deeply...

Consider two objects in physical contact -- one hot and one cold.
  [diagram]

The atoms vibrate more quickly in the hot object.

The atoms vibrate less quickly in the cold object.

Atoms from the hot and cold objects (along the contact surface)
are interacting with each other.

Forces are involved at the atomic level.

Presumably the forces will tend to point randomly, and thus should
add up to essentially zero at the large scale (no large-scale motion).

The work done by an individual force is likely NOT zero.

Given the random nature, one might expect that work is as likely to
pass in one direction as the other, given the random nature of these
forces.

That turns out to be wrong: work done by these forces tends to pass
in one direction: from hot to cold.
  W_cold > 0 ; W_hot < 0

(this should be obvious if one of the objects is zero temperature)

This is simply the Equipartition Theorem doing its thing --
redistributing energy equally among all accessible modes associated
with BOTH objects.

This "atomic scale work" is heat flow.

Because there are no large-scale (mechanical) forces involved, it does
not make sense to disignate this as "work" in the usual sense.

======================================================================

**** Thermodynamic definition of the work done by friction

I want to finish things up here with a thermodynamically accurate
treatment of the work done by friction.

Suppose (to keep things simple) we have a block sliding along a floor
with uniform friction.

[diagram]

In our initial treatment of energy/work (object view), we showed that
the work done by friction on the block is given by
  W_fk = -fk d
and that this work contributes to a decrease in the mechanical energy
of the block by the same amount.

While the decrease in mechanical energy by fk d is accurate, it turns
out the first statement is not when "work done by a force" is
carefully defined.


Here is the issue...

The block will slide along the ground, eventually slowing to rest
because of friction.

The (large-scale) kinetic energy of the block is converted to thermal
energy.

During the slow-down period, some of that thermal energy will be
generated in the floor, and some of that thermal energy will remain
with the block.
  T_floor = amount of thermal energy generated in the floor
  T_block = amount of thermal energy generated within the block

The generated thermal energy will eventually spread out across the
vast planet we live on (the block + floor will return to "room
temperature").

[diagram sequence:
  A:initial time: block room temperature (show atomic vibrations) and moving
  B:stopping: block at rest; elevated temperature of block + nearby floor
  C:long time: block at rest, room temperature; thermal energy spread out]

Simplifying assumption: time scale involved with slow-down is much
shorter than time scale involved with heat flow -- heat flow is
actually rather slow.

As a result:
  Net decrease in mechanical energy of the block
    = fk d
  Net decrease in total energy of the block during slow-down
    = fk d - T_block  (= T_floor)
  Net decrease in total energy of the block, long term
    = fk d

During the short slow-down period, there is no time for heat flow.

Thermal energy created within block hasn't yet left the block.

Therefore, the work done by friction (total work done by atomic forces)
is not -fk d, but is less negative by the amount of thermal energy
retained by the block.
  W_fk = -[fk d - T_block] = -T_floor

Note: the discrepancy between this result and our object-view
calculation is NOT due to the fact that heat flow is not counted
as a form of work -- no heat has flowed yet.

The discrepancy is due to the fact that, in the object-view
calculation, we did not consider the motion (relative to the block cm)
of the block atoms involved in friction bonding.


Mathematical calculation...

For the moment, we will adopt elementary particle view and treat the
block as a system of particles (atoms).  This will blur the distinction
between mechanical and non-mechanical energy.

  K_cm = (1/2) M vcm^2     (object view KE)
  K_rel = sum_i (1/2) m_i v_ic^2    (contribution to thermal energy)
  W_cm = sum_i int fki dot v_cm dt   ("fki" = net bonding force on atom i)
       = int fk dot dr_cm
       = -fk d
  W_rel = sum_i int fki dot v_ic dt
        != 0

  DK_cm = W_cm
        = -fk d   (based on object-view calculation and discussion)
  DK_rel = W_rel
         = W_rel,ext + W_rel,int  (floor-block bonds + block-block bonds)
         = W_rel,ext - DU_sys   (block-block bonds are internal conservative)
  E_sys = K_cm + K_rel + U_sys
                 ^^^^^^^^^^^^^----- virtually all of this is thermal energy
  DE_sys = W_ext = W_cm + W_rel,ext

The non-mechanical (thermal) energy of the block includes (most of)
what we are calling K_rel + U_sys.

This contribution to E_sys changes by W_rel,ext as the block slides.

Thus,
  T_block = W_rel,ext  (work done on block atoms by floor-block bonds relative to block cm)

Note that W_rel,ext = T_block > 0, since the thermal energy of the
block increases due to friction.

The TRUE work done by friction is given by
  W_fk = W_fk,cm + W_fk,rel
       = (-fk d) + T_block
       = -[fk d - T_block]
      (= -T_floor)

It follows that true W_fk is less negative than -fk d.  It represents
the amount of energy lost by the block during the friction process,
which will equal the amount of thermal energy generated in the floor
(not the block).

After some additional time passes, the excess thermal energy of the
block will gradually spread to its environment as a heat flow.

Overall, E_sys will decrease by fk d:
  DE_sys = W_fk + Q
         = -[fk d - T_block] - T_block
         = -fk d


Pedogogical considerations:

Some introductory physics textbooks will try to discuss the
thermodynamically accurate treatment of work done by friction, or at
least explain why that work is not actually -fk d.

Inevitably, they fail spectacularly because they don't have time to
go into the level of detail that we have gone into.

Some introductory physics textbooks will avoid discussion of work
done by friction altogether, just to avoid this problem.

This creates its own issues: somehow friction is SPECIAL, in that we
never discuss work done by friction, but somehow it affects mechanical
energy.

I made a conscious decision to discuss (object-view) work done by
friction earlier on, knowing that this discussion would need to be
amended later if we wanted to account for these issues.

My belief is that this will ultimately create the least confusion.

Note that the true goal of teaching science is to leave the students
with the most accurate sense of how the world works, and to provide
them with the best tools to carry out their own study.

This goal is not always best served by limiting ourselves to 100%
accurate statements.

It is not what we say that matters; it is what they think afterwards
that counts.

======================================================================

*** Epilog -- the role of thermodynamics in atomic-view energy

The world of elementary particles is truly a simple place.

We ignore the mechanical energy cutoff and all of the statisical
analysis of large systems, and treat all particle motions equally.

When we do this, we have particles interacting entirely via
conservative (fundamental) forces:
- Gravity
- Electromagnetic
- Weak (radioactive)
- Strong (nuclear)

All energy is mechanical (we are ignoring fields for now):
  E_sys = K_sys + U_sys

All energy transfers between systems occur as a result of work done by
external forces:
  DE_sys = W_ext

Total energy is obviously conserved.

The complexity of real-world energy analysis arises when we try to
summarize the behavior of large systems of small particles in terms of
large-scale behaviors and trends.

It is at that point that non-mechanical forms of energy
  thermal, chemical, ...,
as well as non-conservative forces
  friction, collision, ...,
and heat transfers, and the concept of temperature itself, etc, need
to be introduced.

Thermodynamics is the price we pay for the right to summarize.
